Database selection for forensic voice comparison
نویسندگان
چکیده
Defining the relevant population to sample is an important issue in data-based implementation of the likelihood-ratio framework for forensic voice comparison. We present a logical argument that because an investigator or prosecutor only submits suspect and offender recordings for forensic analysis if they sound sufficiently similar to each other, the appropriate defense hypothesis for the forensic scientist to adopt will usually be that the suspect is not the speaker on the offender recording but is a member of a population of speakers who sound sufficiently similar that an investigator or prosecutor would submit recordings of these speakers for forensic analysis. We propose a procedure for selecting background, development, and test databases using a panel of human listeners, and empirically test an automatic procedure inspired by the above. Although the automatic procedure is not entirely consistent with the logical arguments and human-listener procedure, it serves as a proof of concept for the importance of database selection. A forensicvoice-comparison system using the automatic database-selection procedure outperformed systems with random database selection.
منابع مشابه
Humans versus Machine: Forensic Voice Comparison on a Small Database of Swedish Voice Recordings
A procedure for comparing the performance of humans and machines on speaker recognition and on forensic voice comparison is proposed and demonstrated. The procedure is consistent with the new paradigm for forensic-comparison science (use of the likelihood-ratio framework and testing of the validity and reliability of the results). The use of the procedure is demonstrated using a small database ...
متن کاملAssessing the Admissibility of a New Generation of Forensic Voice Comparison Testimony
This article provides a primer on forensic voice comparison (aka forensic speaker recognition), a branch of forensic science in which the forensic practitioner analyzes a voice recording in order to provide an expert opinion that will help the trier-of-fact determine the identity of the speaker. The article begins with an explanation of ways in which human speech varies within and between speak...
متن کاملReplicate Mismatch between Test and Background/Development Databases: The Effect on the Performance of Likelihood Ratio-based Forensic Voice Comparison
This study reports the extent that mismatch in within-speaker replicate numbers between test and background/development databases causes an influence on the performance of a forensic voice comparison (FVC) system. FVC tests are repeatedly carried out using the Monte Carlo simulation technique with temporal MFCC features and the Multivariate Kernel Density Likelihood Ratio Procedure. The perform...
متن کاملFABIOLE, a Speech Database for Forensic Speaker Comparison
A speech database has been collected for use to highlight the importance of “speaker factor” in forensic voice comparison. FABIOLE has been created during the FABIOLE project funded by the French Research Agency (ANR) from 2013 to 2016. This corpus consists in more than 3 thousands excerpts spoken by 130 French native male speakers. The speakers are divided into two categories: 30 target speake...
متن کاملWhat is the Relevant Population? Considerations for the Computation of Likelihood Ratios in Forensic Voice Comparison
In forensic voice comparison, it is essential to consider not only the similarity between samples, but also the typicality of the evidence in the relevant population. This is explicit within the likelihood ratio (LR) framework. A significant issue, however, is the definition of the relevant population. This paper explores the complexity of population selection for voice evidence. We evaluate th...
متن کامل